Image quality assessment to merchandise an item

ABSTRACT

Image-based features may be significantly correlated with click-through rates of images that depict a product, which may provide a more formal basis for the informal notion that good quality images will result in better click-through rates, as compared to poor quality images. Accordingly, an image assessment machine is configured to analyze image-based features to improve click-through rates for shopping search applications (e.g., a product search engine). Moreover, the image assessment machine may rank search results based on image quality factors and may notify sellers about low quality images. This may have the effect of improving the brand value for an online shopping website and accordingly have a positive long-term impact on the online shopping website.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No.14/180,255, filed on Feb. 13, 2014, which is a continuation of U.S.patent application Ser. No. 13/300,305, filed on Nov. 18, 2011, whichclaims the benefit of U.S. Provisional Patent Application No.61/415,177, filed Nov. 18, 2010, the benefit of priority of each ofwhich is claimed hereby, and each of which are incorporated by referenceherein in its entirety.

TECHNICAL FIELD

The subject matter disclosed herein generally relates to the processingof data. Specifically, the present disclosure addresses systems andmethods involving image quality assessments to merchandise an item.

BACKGROUND

A product may be available for purchase from a seller, and the sellermay seek to merchandise one or more items as instances of the product.The product may take the form of a good or a service. Examples of goodsinclude physical items (e.g., a digital camera or a car) and informationitems (e.g., downloaded data). Examples of services include humanservices (e.g., contracted work) and automated services (e.g.,subscriptions). Other examples of products include authorizations (e.g.,access to services, licenses, or encryption keys).

In merchandising an item, the seller may use a network-based system topresent an advertisement of the item to a user of the network-basedsystem (e.g., a potential buyer of the item). Examples of network-basedsystems include commerce systems (e.g., shopping websites), publicationsystems (e.g., classified advertisement websites), listing systems(e.g., auction websites), and transaction systems (e.g., paymentwebsites). Examples of advertisements include banner advertisements,sidebar advertisements, pop-up advertisements, and tool tips.Furthermore, an advertisement of the item may take the form of a searchresult referencing the item, a listing for the item (e.g., within a listof items available for purchase), a review of the item, a comment on theitem, or any suitable combination thereof.

BRIEF DESCRIPTION OF THE DRAWINGS

Some embodiments are illustrated by way of example and not limitation inthe figures of the accompanying drawings.

FIG. 1 is a diagram of a webpage containing advertisements for items,according to some example embodiments.

FIG. 2 is a diagram illustrating images of varying quality, according tosome example embodiments.

FIG. 3 is a network diagram illustrating a network environment suitablefor image quality assessment to merchandise an item, according to someexample embodiments.

FIG. 4 is a block diagram illustrating components of an image assessmentmachine suitable for image quality assessment, according to some exampleembodiments.

FIG. 5-8 are flowcharts illustrating operations of an image assessmentmachine in performing a method of image quality assessment tomerchandise an item, according to some example embodiments.

FIG. 9 is a block diagram illustrating components of a machine,according to some example embodiments, able to read instructions from amachine-readable medium and perform any one or more of the methodologiesdiscussed herein.

DETAILED DESCRIPTION

Example methods and systems are directed to image quality assessment tomerchandise an item. Examples merely typify possible variations. Unlessexplicitly stated otherwise, components and functions are optional andmay be combined or subdivided, and operations may vary in sequence or becombined or subdivided. In the following description, for purposes ofexplanation, numerous specific details are set forth to provide athorough understanding of example embodiments. It will be evident to oneskilled in the art, however, that the present subject matter may bepracticed without these specific details.

An advertisement for an item may include an image of the item (e.g., adigital photograph depicting the item). A network-based system mayenable a seller of the item to create an advertisement for the item(e.g., using a user interface provided by the network-based system to aclient device of the seller) by associating (e.g., by inclusion or byreference) an image of the item with the advertisement. For example, theseller may upload the image to the network-based system for inclusion inthe advertisement. As another example, the seller may provide thenetwork-based system with a reference (e.g., a pointer or an address) tothe image, which may be stored in a database accessible by thenetwork-based system.

In online shopping implementations, for example, a potential buyer mayview an item by looking at an image of the item (e.g., an image includedwithin an advertisement for the item). The item may be significantlycharacterized by its visual appearance (e.g., clothing, shoes, orjewelry). Accordingly, a seller, a potential buyer, or both, may desirethat an image of an item reveal as much about the item as possible.

Images, however, may vary in visual quality, and the visual quality ofan image may influence the effectiveness of an advertisement thatincludes the image. For example, the effectiveness of an advertisementmay be indicated by its click-through rate (e.g., a number of clicks onthe advertisement within a period of time or a percentage of displays ofthe advertisement that correspond to clicks on the advertisement).Accordingly, the effectiveness of an advertisement may be enhanced byinclusion of a high quality image, as compared to a low-quality image.

An image assessment machine may assess (e.g., analyze or compute) one ormore characteristics of an image. The image may be submitted by theseller for inclusion in an advertisement or may be already included inan advertisement. A characteristic of the image may also be described asa feature of the image or an attribute of the image. The imageassessment machine may be implemented as a general-purpose computersystem (e.g., having at least a processor and a memory) configured as aspecial-purpose computer system by special-purpose software to performany one or more of the methodologies described herein, in whole or inpart.

Examples of image characteristics include brightness, contrast,saturation, colorfulness, and dynamic range. An indication of brightnessmay be an average grayscale intensity of an image. An indication ofcolorfulness may be a difference between a color as compared tograyscale. An indication of saturation may be an average level of color(e.g., in red-green-blue (RGB) color space). An indication of dynamicrange may be a range of grayscale intensity from minimum intensity(e.g., darkest portion of the image) to maximum intensity (e.g.,lightest portion of the image). An indication of contrast may be a ratioof a range of intensities to a sum of the maximum and minimumintensities. Another indication of contrast may be a standard deviationof intensities throughout the image.

Other examples of image characteristics include size factors (e.g.,area, aspect ratio, height, or width). Moreover, characteristicsassessed by the image assessment machine may include factors used insegmenting a foreground shape from its background within an image (e.g.,foreground uniformity, background uniformity, image colorfulness, orforeground colorfulness). Where the image is segmented, the imageassessment machine may calculate a ratio of foreground pixels tobackground pixels within the segmented image.

Based on assessing the characteristics of the image, the imageassessment machine may determine one or more scores (e.g., numericalscores) that represent the characteristics. For example, the imageassessment machine may determine an average brightness, a peakbrightness, an average contrast, a peak contrast, a brightness L1distance, a contrast L2 distance, a standard deviation of lightness(e.g., distance away from white in RGB color space), an averagelightness (e.g., in RGB color space), a lightness of the background ofthe image, a score representing uniformity of background intensities(e.g., as approximation of texture in the background), or any suitablecombination thereof.

The one or more scores may be represented as one or more vectors (e.g.,a feature vector). For example, a feature vector may be determined for aparticular image, and feature vectors for multiple images may beclassified (e.g., by being processed with a classifier module of theimage assessment machine) based on their probabilities of being “goodquality” images (e.g., images that are likely to enhance theclick-through rate of an advertisement). The classifier may determine anoverall score (e.g., an overall quality score) for each image, based onone or more feature vectors of each image, where the overall scorerepresents a relative probability of being a “good quality” image.

As examples of high quality images, images describable as “professionalquality images” tend to have mostly white or light backgrounds, and thebackground tends to be uniform. In the foreground, the item may stronglycontrast with the background, hence facilitating segmentation of theforeground from the background. The edge of the item may be continuous,and the image may be free from compression-related artifacts (e.g.,blur, contouring, mosquito noise, or blocking). Moreover, the image maybe taken from an appropriate direction and under proper lighting thatpresents the item well. Furthermore, the depiction of the item in theimage tends to have a reasonable size (e.g., relative to the image as awhole) and tends to be in focus. In addition, such images are likely tohave a reasonable aspect ratio (e.g., within a predetermined thresholddistance from 1).

In contrast to high quality images, medium quality images (e.g., “OK”images) are not “professional quality,” but may be considered reasonablyacceptable by most people (e.g., users of the network-based system). Thebackgrounds may be moderately colored or textured. Contrast between theforeground and the background may be moderate. The edge of the item maybe discontinuous (e.g., item is shown as truncated or “chopped off” inthe image). Compression-related artifacts may be present in the image,and the image may be taken from a sub-optimal direction that presentsthe item adequately but not well.

Images may be described as low quality where the images have darkbackgrounds (e.g., having mostly dark colors) or background patternsthat are distracting, confusing, or ugly. The foreground and thebackground may have little contrast between them. The image may have anunusual or inappropriate aspect ratio (e.g., ratio of height to width).The depiction of the item in the image may be unreasonably small orunclear. The image itself may be unreasonably small (e.g., inresolution). A low quality image may have multiple depictions of thesame product in the image.

Using one or more scores (e.g., an overall quality score) determinedfrom the image, the image assessment machine may perform one or moreoperations pertinent to merchandising the item. For example, based onone or more scores for the image, the image assessment machine maynotify the seller that the image is of low quality and prompt the sellerto provide a higher quality image of the item. The image assessmentmachine may suggest or provide one or more alternative images (e.g.,stored in a database accessible by the image assessment machine) to theseller.

As another example, the image assessment machine may raise a search rankof an advertisement that has a high quality image, or lower a searchrank of an advertisement that has a low quality image, such that searchresults presented to a user are biased in favor of advertisements withhigh quality images. The image assessment machine may use one or moremachine-learning techniques to perform analysis of the effectiveness ofadvertisements that are presented as search results. Based on thisanalysis, the image assessment machine may adjust its raising orlowering of search ranks.

As a further example, the image assessment machine may segment the image(e.g., into a foreground portion and a background portion) and determinethe shape of the item depicted in the image. Where the advertisement ofthe item includes textual information describing the item (e.g., title,description, summary, price, product name, or manufacturer), the imageassessment machine may analyze the textual information based on theshape of the item. For example, a seller may upload an image of a pairof pants and erroneously submit textual information describing the imageas depicting “a hat.” The image assessment machine, after segmenting theshape of the pair of pants from the image, may perform semantic analysisto distinguish images that are more informative, more aestheticallypleasing, or both. For example, the image assessment machine may accessa database that correlates that particular shape with the word “pants”but does not correlate that particular shape with the word “hat.” Basedon the accessing of the database, the image assessment machine maynotify the seller of the potential error (e.g., “The item in the imagedoes not look like a hat.”).

In yet another example, the image assessment machine may determine,based on the shape of the item, that the image depicts an item thatmatches the textual information describing the item (e.g., the imageactually shows a hat) and also determine that the image contains a poordepiction of the item. Accordingly, the image assessment machine maynotify the seller that the image is a low quality image that mayadversely impact merchandising of the item (e.g., “The image of the hatis confusing,” or “it is hard to see the hat in the image youuploaded.”).

In a further example, the image assessment machine may identify orsuggest, based on the shape of the item depicted in the image, apotential problem with the image, a potential suggestion for taking abetter image, or both. The image assessment machine may notify theseller that the image has been truncated (e.g., clipped), rotated, orskewed, or that the item is shown in a bad pose (e.g., folded) or shownwith bad lighting.

In addition, the semantic analysis may be performed by the imageassessment machine to identify (e.g., recognize) the item depicted inthe image (e.g., among multiple objects depicted in the image). Forexample, where several objects are depicted in an image, but the textualinformation describes the image as depicting “a hat,” the imageassessment engine may identify a particular hat-like object as being themost relevant object in the image. Accordingly, the image assessmentengine may determine that the hat-like object is the item shown in theimage.

In some example embodiments, the image assessment machine may identify(e.g., recognize) a style of the item based on the shape of the itemdepicted in the image. As an example, where the item is a dress, theimage assessment machine may recognize that the dress is a “sleeveless”dress, a “low-cut” dress, or a dress with an “A-line skirt,” based onthe shape of the dress as depicted in the image. The style of the itemmay be used or suggested by the image assessment machine as furthertextual information (e.g., a tag) describing the item.

Moreover, the semantic analysis may include detection of text in theimage (e.g., product name, brand name, manufacturer name, or designername). For example, optical character recognition (OCR) techniques maybe used by the image assessment machine to recognize textual informationin the image (e.g., human-readable or machine-readable). Similarly, thesemantic analysis may include detection of a watermark (e.g., text orlogo) in the image. Also, the image assessment machine may detect ahuman model depicted in the image (e.g., face, eyes, nose, mouth, orhand). The textual information from an advertisement (e.g., as submittedby a seller) may be compared to the detected text, watermark, humanmodel, or any suitable combination thereof. The image assessment machinemay perform this comparison and accordingly suggest or provideadditional textual information for inclusion in the advertisement.

In some example embodiments, the image assessment machine identifies theshape of undesired overlays in the image. An overlay in the image may beundesired by the seller, a potential buyer, an operator of thenetwork-based system, or any suitable combination thereof. Examples ofundesired overlays include vulgar or offensive graphics or text,information redirecting potential buyers to a competitor of thenetwork-based system, and information redirecting potential buyers to acompetitor of the seller. The image assessment machine may identify theshape of an undesired overlay by segmenting the image (e.g., intoforeground and background portions). According to certain exampleembodiments, this may be performed without the use of OCR techniques(e.g., without recognizing the meaning of undesired text).

According to various example embodiments, the image assessment machinerecognizes multiple images that depict the same item. This recognitionmay be based on the shape of the item depicted in the images. Moreover,one or more color diffusion algorithms may be used by the imageassessment machine to recognize the item in multiple images. Forexample, two images of the same product may have different histograms(e.g., as a result of being photographs taken under different lightingconditions). The image assessment machine may use one or more colordiffusion algorithms to allocate energy represented in a histogram intoone or more partitions (e.g., bins) in a manner that spreads (e.g.,blurs) the energy among neighboring partitions. The number of partitionsused may vary among certain example embodiments, some of which use arelatively small number of partitions (e.g., to reduce computationaleffort). Accordingly, the same item may be identified in twodifferent-looking images.

Image-based features may affect a click-through rate for an image, forexample, in the context of a large-scale product search engine. Productsearch engines may use text-based features in ranking products returnedas search results. A product search engine (e.g., incorporating an imageassessment machine, discussed above) may use image-based features inaddition to text-based features. In some example embodiments, theproduct search engine may use a stochastic gradient boosting regressionmodel to model a click-through rate. Based on experimental results,there is a statistically significant correlation between image featuresand click-through rate.

Online shopping constitutes a significant portion of commerce.Convenience may be an advantage of online shopping, but an inability totouch and feel the product may be a drawback. Images therefore may playan important role in online shopping. Good quality images may provide abetter user experience by providing a better idea of the productfeatures and conditions. Good quality images also may improve the brandimage of the seller.

Product search engines may function as gateways for users into the worldof online shopping. Typically, a user submits a query into a productsearch engine, and the product search engine presents the user with asearch results page (SRP). The SRP contains a list of search results.Each entry in the list of search results may include an imageaccompanied by one or more product attributes (e.g., product title,summary, price etc.). The user may look at the image and the textualdata and then make a decision to buy the product or click on the entryfor further information.

Many product search engines are small in scale, and images of productsmay be manually edited (e.g., selected, modified, or approved) beforethe images are used by the system for search results. This may have theeffect of ensuring that high quality images are displayed. However, in alarge scale online market without manual editing of images, the variancein the quality of images may be significant. In such cases, it may bedesirable to incorporate image features into one or more ranking modelssuch that relevant results with good images get ranked more highlycompared to relevant results with bad images. This may serve as anincentive for sellers to make efforts to submit high quality images. Aprevalence of results having high quality images also may driveadditional buyer traffic to the online market, thereby benefiting allsellers. Accordingly, there may be a long-term value in incorporatingimage-based features into a ranking model. For example, use of theimage-based features in the ranking model may gradually improve thebrand value of the product search engine.

Image features may be considered using two classes: global features andregional features. Global features may be computed over an entire image(e.g., a whole image) and may be represented as global statistics overthe entirety of the image. Examples of global features includebrightness and contrast.

Brightness may be representative of an amount of illumination within theimage. Brightness may be computed as a normalized average grayscaleintensity of the image. For example, brightness may be computed from RGBprimaries in an RGB colorspace using the expression:0.3R+0.6G+0.1B

Contrast may be representative of a variation in visual properties thatmakes an object in the image appear clearer to a viewer. Multiple kindsof contrast may be supported by the image assessment machine. An exampleof contrast is root mean square contrast, computed from the followingexpression:

$\sqrt{\frac{1}{M\; N}{\sum\limits_{i = 0}^{N - 1}\;{\sum\limits_{j = 0}^{M - 1}\;\left( {I_{i\; j} - \overset{\_}{I}} \right)}}}$where A M×N is the size of the image, where I is the grayscaleintensity, and where Ī is the average intensity of the image.

Regional features may be non-global. For example, a product image maydetect an object placed against a background. Regional features may becomputed separately for the background and for the foreground object.The image assessment machine may derive information from the foregroundindividually, from the background individually, from comparing theforeground to the background, or any suitable combination thereof. Insome example embodiments, the image assessment machine segments theimage into a background and the foreground. For example, a Grabcut-basedapproach (e.g., an implementation of the Grabcut algorithm) may be usedto segment the image. As noted above, good image quality may facilitateimage segmentation (e.g., to be more accurate or to be lesscomputationally intensive). Examples of regional features includelightness of background, colorfulness of foreground, and ratio ofbackground to foreground.

In many situations, a good product image has a light colored background.The image assessment machine, after segmenting the background from theimage, may measure a quantity computed from the standard deviation andmean of the distance between the intensity of each color pixel and areference color (e.g., white background color). For example, pure whitemay be represented in RGB color space as (255, 255, 255), and an L2 normmay be used by the image assessment machine to compute the distance ofeach pixel intensity from pure white. The expression:αμ+βσmay be used by the image assessment machine, where μ is the mean and σis the standard deviation, and where α and β are constants (e.g., α=1and β=0.3).

Colorfulness of the foreground may be a quantity representative of humanperception of color. Multiple kinds of colorfulness may be supported bythe image assessment machine. For example, the image assessment machinemay implement an expression based on the sRGB colorspace, as set forthin D. Hasler and S. E. Suesstrunk (“Measuring colorfulness in naturalimages,” Proceeding of SPIE Electron. Imag.: Human Vision and Electron.Imag. VIII, 5007:87-95, 2003).

The ratio of background to foreground may be computed by the imageassessment machine by determining the ratio of the area of theforeground to the area of the background. In many situations, a largerratio indicates that the product image size is large within the frame ofthe image.

In experiments, datasets were derived from query logs from an onlineshopping website. The datasets include tuples of the form (query,product identifier, click-through rate). A tuple, therefore, mayrepresent a query submitted by a user, a product identifier, and theclick-through rate for that product. The click-through rate for aproduct may be defined as number of clicks per impression for an item(e.g., per viewing of the image of the item). The datasets were derivedfrom approximately 150,000 pairs of queries and products. In theexperiments, the queries were not chosen randomly. An image assessmentmachine computed a text match score and extracted values of attributesthat are visible in the search results page. The image assessmentmachine then trained a boosted tree-based regression model, usingclick-through rate as a target. Next, the image assessment machine addedimage features into the training dataset.

It was observed that the mean standard error for regression was reduced.Also, a normalized discounted cumulative gain (NDCG) improvement wasobserved, where the NDCG was computed using human judgment data. Table 1summarizes the results obtained from the experiments.

TABLE 1 Regression study with image features Mean Standard Models Error(MSE) NDCG 5 NDCG 10 No image factors 0.0209 0.8092 0.8719 With imagefactors 0.0207 0.8151 0.8761

The target in this case is the click-through rate of the item, given thequery. The results show a slight improvement in NDCG at the 5th positionand at the 10th position. Also, the results show a slight improvement inreducing the mean standard error (MSE) for the regression. Accordingly,the NDCG improvement may be significant where tuples are sampled from aproduct category (e.g., fashion) where images are more important thanother attributes. Regression analysis techniques may provide a rank ofsignificance for one or more features. In the experiments, it wasobserved that the colorfulness of foreground and the root-mean-square(RMS) contrast were the number one and two most significant imagefeatures for ranking.

In another experiment, approximately 1000 items from a fashion-relatedproduct category were studied for a correlation between image featuresand click-through rate. Tables 2 and 3 show the results of the study.

TABLE 2 Correlation study with image features Global features ContrastBrightness ρ 0.14 0.10 p-value 10⁻⁶ 10⁻⁸

In Table 2, ρ is the Spearman's rank coefficient in the table. Alsoshown is the “p-value” for the experiment. It was observed that thelightness of the background is most correlated with click-through rate,which is consistent with the anecdotal marketing wisdom that a productdepicted against a lighter background is better emphasized compared toproduct depicted against darker backgrounds.

TABLE 3 Correlation study with image features Ratio of Lightness ofColorfulness of foreground area to Regional features backgroundforeground background area ρ 0.21 0.15 0.10 p-value 10⁻¹⁶ 10⁻⁴ 10⁻⁶

Tables 2 and 3 show that the image features studied in the experimentare correlated with click-through rate and that the correlations arestatistically significant. It should be appreciated that the importanceof the image features, as ranked by a particular regression model, maybe different from the importance of the image features based on acorrelation study. Moreover, an image assessment machine may computeadditional features from images, beyond the global or regional featuresdescribed herein, and use these additional features in performingassessments of image quality.

Based on the experiments, it was observed that image-based features maybe significantly correlated with click-through rate, which may provide amore formal basis for the informal notion that good quality images willresult in better click-through rates, as compared to poor qualityimages. Accordingly, image-based features may facilitate modelingclick-through rates for shopping search applications (e.g., a productsearch engine). Moreover, ranking search results based on image qualityfactors may provide sellers with an incentive to provide high qualityimages. This may have the effect of improving the brand value for anonline shopping website and, accordingly, have a positive long-termimpact on the online shopping website.

FIG. 1 is a diagram of a webpage 100 containing advertisements foritems, according to some example embodiments. As an example, anadvertisement 110 is an advertisement for pants that are available forpurchase from a seller of the pants. The pants are the item that is tobe merchandised by the advertisement 110. The advertisement 110 includesan image 111 of the item and a description of the item (e.g., “Urbandenim pants, dark green, size 10, by Tommy. Lemongrass”). Thedescription of the item includes textual information 113 (e.g., “pants”)that describes the item or is submitted for inclusion in the descriptionof the item. For example, the textual information 113 may be submittedby the seller of the pants (e.g., one or more seller-submitted tokens,words, or phrases). As another example, the textual information 113 maybe suggested by an image assessment machine (e.g., based on an analysisof the image 111). The advertisement 110 also includes a price 115(e.g., “$100.00”) of the pants. Other advertisements appearing in thewebpage 100 may be structured in a manner similar to the advertisement110.

FIG. 2 is a diagram illustrating images 111 and 210-250, which are ofvarying quality, according to some example embodiments. The image 111may be a high-quality image (e.g., an image with high visual quality forpurposes of merchandising the item depicted therein), while the images210, 220, 230, 240, and 250 may be low-quality images (e.g., images withlow visual quality for purposes of merchandising the item depictedtherein).

As shown, the image 111 is an example of a high-quality image (e.g., animage that is relatively high in visual quality relative to the images210-250). For example, the image 111 may depict the item (e.g., thepants) in clear focus and with well-defined edges. The item (e.g., as aforeground portion of the image 111) may be shown prominently and takeup a large percentage or ratio of the size (e.g., total area) of theimage.

The image 111 may have one or more visual characteristics that arecorrelated with an effectiveness (e.g., as measured in actualclick-through counts or modeled projected click-through counts) formerchandising items. For example, these visual characteristics may becorrelated with high effectiveness (e.g., relative to other images) inmerchandising items. Furthermore, one or more of these visualcharacteristics may be quantifiable (e.g., computationally calculable)by an analysis of the image 111. Examples of such visual characteristicsinclude brightness (e.g., average brightness), contrast (e.g., averagecontrast), saturation (e.g., color intensity relative to brightness),colorfulness (e.g., distance away from gray within a color space),dynamic range (e.g., for brightness, contrast, or color), size (e.g.,total area, total width, or total height), aspect ratio (e.g., of widthto height), lightness (e.g., distance away from white within a colorspace), foreground uniformity (e.g., average foreground uniformity),background uniformity (e.g., average background uniformity), and theratio of foreground pixels (e.g., the number of pixels in the foregroundportion of the image 111) to background pixels (e.g., the number ofpixels in the background portion of the image 111).

The image 111 may include image data (e.g., digital data representing orencoding an array of pixels that, in aggregate, form the visual picturepresented by the image 111). Hence, one or more of the visualcharacteristics of the image 111 may be quantifiable by analysis of theimage data contained in the image 111. According to various exampleembodiments, the image 111 may be a data file (e.g., stored withinmemory or in a data repository) that includes the image data (e.g.,along with metadata describing the image 111).

As shown, the image 210 is an example of a low-quality image (e.g.,relative to the image 111). In the image 210, the item is depicted astoo small within the image 210 (e.g., with a low ratio of foregroundpixels to background pixels). The image 210 is also too bright andinsufficiently well-defined (e.g., low sharpness).

The image 220 is another example of a low-quality image (e.g., relativeto the image 111). As shown, the image 220 depicts the item with amediocre orientation (e.g., sideways for a pair of pants) within theimage 220. The image 220 is also too dark and has poor contrast betweenforeground and background portions of the image 220.

The image 230 is a further example of a low-quality image (e.g.,relative to the image 111). As shown, the image 230 depicts the itemwith a very poor orientation (e.g., a severely oblique view) and with acluttered and nonuniform background. In addition, the item is poorlycentered within the image 230 (e.g., placed in the lower right portionof the image 230).

The image 240 is yet another example of a low-quality image (e.g.,relative to the image 111). As shown, the image 240 has a very lowaspect ratio (e.g., of width to height) and depicts the item in adistorted (e.g., stretched thin) manner. Furthermore, the image 240 andthe item depicted therein may be too small to effectively merchandisethe item.

The image 250 is a yet further example of a low-quality image (e.g.,relative to the image 111). As shown, the image 250 does not depict thefull item and instead depicts only a portion of the item. Furthermore,the image 250 shows the portion of the item in an awkward orientation(e.g., nearly upside down when viewed on the webpage 100).

FIG. 3 is a network diagram illustrating a network environment 300suitable for image quality assessment to merchandise an item, accordingto some example embodiments. The network environment 300 includes animage assessment machine 310, a database 315, and devices 330 and 350,all communicatively coupled to each other via a network 390. In someexample embodiments, the device 330 and the device 350 are connected tothe image assessment machine 310 by a single network, while inalternative example embodiments, the devices 330 and 350 are connectedto the image assessment machine 310 by multiple networks. The imageassessment machine 310, the database 315, and the devices 330 and 350each may be implemented in a computer system, in whole or in part, asdescribed below with respect to FIG. 9.

The image assessment machine 310 is configured with one or more modulesdescribed below with respect to FIG. 4. As configured by the one or moremodules, the image assessment machine 310 may perform all or part of amethod described below with respect to FIG. 5-8. The image assessmentmachine 310 and the database 315 may form all or part of a network-basedcommerce system 305 (e.g., a network-based system that provides anonline shopping website).

Also shown in FIG. 3 are users 332 and 352. For example, the user 332may be a seller of the item (e.g., the pants) depicted in the image 111,and the user 352 may be a shopper for the item (e.g., a potential buyerof the item). One or both of the users 332 and 352 may be a human user(e.g., a human being), a machine user (e.g., a computer configured by asoftware program to interact with the device 330), or any suitablecombination thereof (e.g., a human assisted by a machine or a machinesupervised by a human). The user 332 is not part of the networkenvironment 300, but is associated with the device 330 and may be a userof the device 330. For example, the device 330 may be a tablet computeror a smart phone belonging to the user 332. Likewise, the user 352 isnot part of the network environment 300, but is associated with thedevice 350. As an example, the device 350 may be a tablet computer or asmart phone belonging to the user 352.

Any of the machines, databases, or devices shown in FIG. 3 may beimplemented in a general-purpose computer modified (e.g., configured orprogrammed) by software to be a special-purpose computer to perform thefunctions described herein for that machine. For example, a computersystem able to implement any one or more of the methodologies describedherein is discussed below with respect to FIG. 9. As used herein, a“database” is a data storage resource and may store data structured as atext file, a table, a spreadsheet, a relational database, a triplestore, or any suitable combination thereof. Moreover, any two or more ofthe machines illustrated in FIG. 3 may be combined into a singlemachine, and the functions described herein for any single machine maybe subdivided among multiple machines.

The network 390 may be any network that enables communication betweenmachines (e.g., image assessment machine 310 and the device 330).Accordingly, the network 390 may be a wired network, a wireless network(e.g., a mobile network), or any suitable combination thereof. Thenetwork 390 may include one or more portions that constitute a privatenetwork, a public network (e.g., the Internet), or any suitablecombination thereof.

FIG. 4 is a block diagram illustrating components of the imageassessment machine 310, according to some example embodiments. The imageassessment machine 310 includes an access module 410, an analysis module420, a score module 430, an action module 440, a segmentation module450, and the semantics module 460, all configured to communicate witheach other (e.g., via a bus, shared memory, or a switch). Any one ormore of the modules described herein may be implemented using hardware(e.g., a processor of a machine) or a combination of hardware andsoftware. Moreover, any two or more of these modules may be combinedinto a single module, and the functions described herein for a singlemodule may be subdivided among multiple modules.

The access module 410 is configured to access the image 111 that depictsan item (e.g., pants). The image 111 may be accessed from the database315 by the access module 410. As noted above, the image may have one ormore visual characteristics that are quantifiable by an analysis ofimage data included in the image. Further details of the access module410 are described below with respect to the example method illustratedby FIG. 5-8.

The analysis module 420 is configured to perform the analysis of theimage data included in the image 111. Further details of the analysismodule 420 are described below with respect to the example methodillustrated by FIG. 5-8.

The score module 430 is configured to determine a score of the image111, where the score determined by the score module 430 qualifies avisual characteristic (e.g., among multiple visual characteristics) ofthe image 111. Moreover, the determining of the score may be based onthe analysis performed by the analysis module 420 (e.g., the analysis ofthe image data included in the image 111).

The action module 440 is configured to perform an operation that ispertinent to merchandising the item depicted by the image 111 (e.g., anoperation that references the image 111, the textual information 113,the price 115, the item itself, or any suitable combination thereof).Moreover, the performing of the operations may be based on the scoredetermined by the score module 430, which may be based on the analysisperformed by the analysis module 420. Examples of the operation andfurther details of the action module 440 are described below withrespect to the example method illustrated by FIG. 5-8.

The segmentation module 450 is configured to segment the image 111 intoa foreground portion of the image 111 and a background portion of theimage 111. For example, the segmentation module 450 may use aGrabcut-based approach (e.g., algorithm) to determine one or more edgesof the foreground portion of the image 111 and distinguish it from thebackground portion of the image 111, resulting in segmentation of theimage 111. With the image 111 segmented by the segmentation module 450,the analysis module 420, the score module 430, or both, may operate onthe foreground portion separately from the background portion of theimage 111. In some example embodiments, the segmentation module 450 isconfigured to determine (e.g., detect) a shape of the item (e.g., pants)depicted in the image 111, where the shape of the item is the foregroundportion of the image 111. In other words, the foreground portion maydepict the item, while the background portion depicts no part of theitem. In addition, unless explicitly stated otherwise, all functionalityof the analysis module 420 and the score module 430 described withrespect to the entirety of the image 111 are similarly applicable to oneor more portions of the image 111 (e.g., applicable to the foregroundportion, to the background portion, or to both, separately or together).

The semantics module 460 is configured to perform semantic analysis onall or part of the image 111 (e.g., the foreground portion, thebackground portion, or both, separately or together). A semanticanalysis performed by the semantics module 460 may be based on a shapedetected in the image 111 (e.g., the shape of the item, as segmentedinto the foreground portion of the image 111), text detected in theimage 111 (e.g., using one or more OCR techniques), or any suitablecombination thereof. For example, the database 315 may store informationcorrelating a shape of an item (e.g., pants) with a textual descriptorof the shape (e.g., the word “pants”), and the semantics module 460 mayaccess the database 315 and access (e.g., read) the textual descriptorbased on the shape. Further details of the semantics module 460 aredescribed below with respect to the example method illustrated by FIG.5-8.

FIG. 5-8 are flowcharts illustrating operations of the image assessmentmachine 310 in performing a method 500 of image quality assessment tomerchandise an item, according to some example embodiments. Operationsin a method 500 may be performed by the image assessment machine 310,using modules described above with respect to FIG. 4. As shown in FIG.5, example embodiments of the method 500 include operations 510, 520,530, and 540.

In operation 510, the access module 410 accesses the image 111 thatdepicts an item (e.g., pants). As noted above, the image 111 has one ormore visual characteristics (e.g., a plurality of visualcharacteristics) that are quantifiable by an analysis of image data thatis included in the image 111. Examples of such visual characteristicsare discussed above with respect to FIG. 2.

In operation 520, the analysis module 420 performs the analysis of theimage data that is included in the image 111, which is accessed inoperation 510. According to various example embodiments, the analysismay include analyzing one or more of the visual characteristics of theimage 111, and in certain example embodiments, multiple visualcharacteristics are analyzed in operation 520. Further details ofoperation 520, according to various example embodiments, are discussedbelow with respect to FIG. 6.

In operation 530, the score module 430 determines a score (e.g., aquality score, a visual quality score, or a merchandising effectivenessscore) that quantifies a visual characteristic among the one or morevisual characteristics of the image 111. The determining of the scoremay be based on the analysis performed in operation 520. According tovarious example embodiments, multiple scores may be determined formultiple visual characteristics of the image 111.

In operation 540, the action module 440 performs an operation (e.g., anaction) that is pertinent to merchandising the item depicted by theimage 111 (e.g., an operation that makes reference to the image 111, thetextual information 113, the price 115, the item itself, or any suitablecombination thereof). Operation 540 may be performed based on the scoredetermined in operation 530. Further details of operation 540, accordingto various example embodiments, are discussed below with respect to FIG.6-8.

As shown in FIG. 6, the method 500 may include one or more of operations602, 610, 620, 621, 622, 623, 624, 625, 626, 627, 628, 629, 640, and641. In operation 602, the access module 410 receives the image 111. Theimage 111 may be received as a submission for inclusion in theadvertisement 110 of the item (e.g., pants) depicted in the image 111.Moreover, the image 111 may be received from the user 332 (e.g., via theuser device 330), who may be a seller of the item. According to someexample embodiments, operation 540 may be performed in response to thereception of the image 111 in operation 602.

One or more of operations 640 and 641 may be performed as part (e.g., aprecursor task, a subroutine, or a portion) of operation 540, in whichthe action module 440 performs an operation pertinent to themerchandising of the item depicted in the image 111. Various exampleembodiments of the method 500 include operation 640, operation 641, orboth.

In operation 640, the action module 440 provides a warning that theimage 111 is of low quality (e.g., “The image that you submitted is lowquality. Would you like to submit a different one?”). The warning may beprovided to the user 332 (e.g., as the seller of the item), for example,via the device 330.

In operation 641, the action module 440 provides a suggestion that theimage 111 be replaced with an alternative image (e.g., an image ofhigher quality than the image 111 received in operation 602). Forexample, the suggestion may state, “A better image of your item isavailable. Would you like to use it instead?” The suggestion may beprovided to the user 332 (e.g., as a seller of the item), for example,by the device 330.

One or more of operations 620-629 may be performed as part (e.g., aprecursor task, a subroutine, or a portion) of operation 520, in whichthe analysis module 420 performs the analysis of the image data includedin the image 111. According to various example embodiments, all or partof operations 620-629 may be included in the method 500, in any suitablecombination.

A visual characteristic analyzed in operation 520 may be a brightness(e.g., brightness level) of the image 111. In operation 620, theanalysis module 420 analyzes the brightness of the image 111.Accordingly, the score determined in operation 530 may quantify thebrightness of the image 111, and operation 540 may be performed based onthe brightness of the image 111.

A visual characteristic analyzed in operation 520 may be a contrast(e.g., contrast level) of the image 111. In operation 621, the analysismodule 420 analyzes the contrast of the image 111. Accordingly, thescore determined in operation 530 may quantify the contrast of the image111, and operation 540 may be performed based on the contrast of theimage 111.

A visual characteristic analyzed in operation 520 may be a saturation(e.g., color intensity level) of the image 111. In operation 622, theanalysis module 420 analyzes the saturation of the image 111.Accordingly, the score determined in operation 530 may quantify thesaturation of the image 111, and operation 540 may be performed based onthe saturation of the image 111.

A visual characteristic analyzed in operation 520 may be a colorfulnessof the image 111. In operation 623, the analysis module 420 analyzes thecolorfulness of the image 111. Accordingly, the score determined inoperation 530 may quantify the colorfulness of the image 111, andoperation 540 may be performed based on the colorfulness of the image111.

A visual characteristic analyzed in operation 520 may be a dynamic range(e.g., of brightness, of contrast, or of a particular color) of theimage 111. In operation 624, the analysis module 420 analyzes thedynamic range of the image 111. Accordingly, the score determined inoperation 530 may quantify the dynamic range of the image 111, andoperation 540 may be performed based on the dynamic range of the image111.

A visual characteristic analyzed in operation 520 may be a size (e.g.,total area, total width, or total height) of the image 111. In operation625, the analysis module 420 analyzes the size of the image 111.Accordingly, the score determined in operation 530 may quantify the sizeof the image 111, and operation 540 may be performed based on the sizeof the image 111.

A visual characteristic analyzed in operation 520 may be an aspect ratio(e.g., of width to height) of the image 111. In operation 626, theanalysis module 420 analyzes the aspect ratio of the image 111.Accordingly, the score determined in operation 530 may quantify theaspect ratio of the image 111, and operation 540 may be performed basedon the aspect ratio of the image 111.

Example embodiments that include one or more of operations 627-629 mayalso include operation 610. In operation 610, the segmentation module450 segments the image 111 into a foreground portion of the image 111and a background portion of the image 111. As noted above, theforeground portion depicts the item (e.g., pants), and the backgroundportion depicts no part of the item.

A visual characteristic analyzed in operation 520 may be a foregrounduniformity (e.g., a level of smoothness of the foreground portion) ofthe image 111. In operation 627, the analysis module 420 analyzes theforeground uniformity of the image 111. Accordingly, the scoredetermined in operation 530 may quantify the foreground uniformity ofthe image 111, and operation 540 may be performed based on theforeground uniformity of the image 111.

A visual characteristic analyzed in operation 520 may be a backgrounduniformity (e.g., a level of smoothness of the background portion) ofthe image 111. In operation 628, the analysis module 420 analyzes thebackground uniformity of the image 111. Accordingly, the scoredetermined in operation 530 may quantify the background uniformity ofthe image 111, and operation 540 may be performed based on thebackground uniformity of the image 111.

A visual characteristic analyzed in operation 520 may be a ratio of anumber of foreground pixels from the foreground portion to the number ofbackground pixels from the background portion of the image 111. Inoperation 629, the analysis module 420 analyzes the ratio of foregroundpixels to background pixels. Accordingly, the score determined inoperation 530 may quantify the ratio of foreground pixels to backgroundpixels of the image 111, and operation 540 may be performed based on theratio of foreground pixels to background pixels of the image 111.

As shown in FIG. 7, the method 500 may include one or more of operations710, 720, 721, 722, 723, 724, 725, 741, 742, 743, 744, 745, and 746.Operation 710 may expand upon operation 530, in which the score module430 determines a score that quantifies a visual characteristic of theimage 111. In operation 710, multiple scores (e.g., a plurality ofscores) are determined by the score module 430, and these multiplescores may quantify multiple visual characteristics of the image 111.Moreover, the multiple scores may be determined as one or more featurevectors (e.g., multiple feature vectors or multiple components of asingle multi-dimensional feature vector) that quantify the visualcharacteristics of the image 111 (e.g., in their entirety). Accordingly,a sophisticated profile of the image 111 may be represented by thescores determined in operation 710.

In operation 720, the score module 430 determines an overall score thatrepresents an overall visual quality of the image 111, based on thescores determined in operation 710. For example, the score module 430may compute the overall score as a weighted average of individual scorescorresponding to individual visual characteristics of the image 111. Asanother example, the score module 430 may compute the overall score as apredominant feature vector (e.g., normalized within a multidimensionalfeature vector space). Accordingly, a sophisticated profile of the image111 may be summarized by a single overall score determined in operation720.

In example embodiments that include operation 720, the action module 440may perform operation 540 based on the overall score determined inoperation 720. This is shown as operation 741 in FIG. 7.

One or more of operations 721, 722, 723, 724, and 725 may be performedas part (e.g., a precursor task, a subroutine, or a portion) ofoperation 720, in which the score module 430 determines the overallscore. In a corresponding manner, one or more of operations 742, 743,744, 745, and 746 may be performed as part of operation 540.

In operation 721, the score module 430 determines an overall score thatrepresents an average brightness of the image 111. Accordingly, inoperation 742, the performing of the operation by the action module 440may be based on the average brightness of the image determined 111 inoperation 721.

In operation 722, the score module 430 determines an overall score thatrepresents an average contrast of the image 111. Accordingly, inoperation 743, the performing of the operation by the action module 440may be based on the average contrast of the image 111 determined inoperation 722.

In operation 723, the score module 430 determines an overall score thatrepresents an average lightness of the image 111. Accordingly, inoperation 744, the performing of the operation by the action module 440may be based on the average lightness of the image 111 determined inoperation 723.

In operation 724, the score module 430 determines an overall score thatrepresents an average lightness of a background portion of the image 111(e.g., segmented in operation 610). Accordingly, in operation 745, theperforming of the operation by the action module 440 may be based on theaverage lightness of the background portion of the image 111 determinedin operation 724.

In operation 725, the score module 430 determines an overall score thatrepresents an average uniformity of a background portion of the image111 (e.g., segmented in operation 610). Accordingly, in operation 746,the performing of the operation by the action module 440 may be based onthe average uniformity of the background portion of the image 111determined in operation 725.

As shown in FIG. 8, the method 500 may include one or more of operations801, 811, 821, 822, 823, 830, 832, 841, 842, 843, 844, 845, and 846. Insome example embodiments, operations 801, 811, 830, and 841 areperformed as part of the method 500.

In operation 801, the access module 410 receives textual information(e.g., textual information 113) as a submission for a description of theitem. For example, the textual information may be received from the user332 (e.g., as a seller of the item) via the device 330. The textualinformation received in operation 801 may be described as“seller-submitted textual information” or “seller's textual information”and may describe the item accurately, partially, poorly, or not at all.

Operation 811 may be performed as part (e.g., a precursor task, asubroutine, or a portion) of operation 610, in which the segmentationmodule 450 segments the image 111 into a foreground portion and abackground portion. In operation 811, the segmentation module 450determines the shape of the item (e.g., pants) detected in the image111. For example, the shape of the item may be determined by segmentingthe image 111 as discussed above with respect to operation 610 (e.g.,determining of the outline of the pants).

In operation 830, the semantics module 460 accesses the database 315. Insuch example embodiments, the database 315 correlates the shape of theitem (e.g., as determined in operation 811) with textual information(e.g., textual information 113) that describes the item. The textualinformation accessed in operation 830 may be described as“database-accessed textual information” or database's textualinformation” and may coincide, semantically or verbatim, with theseller-submitted textual information received in operation 801. In somesituations, the database-accessed textual information is different(e.g., semantically different) from the seller-submitted textualinformation.

In example embodiments that include operation 830, the action module 440may perform operation 841 as part (e.g., a precursor task, a subroutine,or a portion) of operation 540. In operation 841, the action module 440provides the database-accessed textual information that was accessed inoperation 830. The action module 440 may provide the database-accessedtextual information based on the shape of the item (e.g., based on theshape of the item being correlated with the database-accessed textualinformation). For example, the database-accessed textual information maybe provided to the user 332 (e.g., as the seller of the item) via thedevice 330.

According to some example embodiments, the database-accessed textualinformation (e.g., textual information 113, “pants”) is more descriptiveof the item depicted in the image 111 than the seller-submitted textualinformation (e.g., “duds I'm selling”). Such example embodiments mayinclude one or more of operations 842 and 843, as part (e.g., aprecursor task, a subroutine, or a portion) of operation 540.

In operation 842, the action module 440 provides a warning that theseller-submitted textual information (e.g., received in operation 801)does not describe the item depicted in the image 111. In some exampleembodiments, the warning states that the seller-submitted textualinformation does not describe the item as well as other textualinformation (e.g., available from the database 315). The warning may beprovided to the user 332 (e.g., as the seller of the item) via thedevice 330. As an example, the seller-submitted textual informationreceived in operation 801 may be the word “hat,” and the warning maystate, “That does not look like a ‘hat’!”

In operation 843, the action module 440 provides a suggestion that theitem depicted in the image 111 be described by the database-accessedtextual information (e.g., accessed in operation 830). In some exampleembodiments, the suggestion proposes that the database-accessed textualinformation be used instead of the seller-submitted textual information.The suggestion may be provided to the user 332 (e.g., as the seller ofthe item) via the device 330. As an example, the database-accessedtextual information may be the word “pants,” and the suggestion maystate, “That looks more like ‘pants,’ instead of a ‘hat.’ Would you liketo update the description?”

According to various example embodiments, the image assessment machine310 may monitor the effectiveness of one or more images (e.g., image111) with respect to merchandising items depicted therein. As notedabove, a click-through rate may form all or part of a measurement ofeffectiveness.

In operation 832, the score module 430 determines a click-through rateof the advertisement 110 that includes the image 111. In some exampleembodiments, the score module 430 calculates the click-through rate(e.g., from web server logs obtained from the network-based commercesystem 305). In other example embodiments, the score module 430 accessesor receives the click-through rate (e.g., from a web server of thenetwork-based commerce system 305).

In operation 844, the action module 440 (e.g., in performing operation540) determines a rank of the image 111 (e.g., among other imagesincluded in other advertisements). The determination of the rank of theimage 111 may be based on the click-through rate determined in operation832, the score determined in operation 530, any of the multiple scoresdetermined in operation 710, the overall score determined in operation720, or any suitable combination thereof.

In operation 845, the action module 440 (e.g., in performing operation540) provides the image 111 within the search result. For example, theaction module 440 may provide a set of search results (e.g., retrievedfrom a search engine or from the database 315), where each search resultis an advertisement, and where one of the advertisements is theadvertisement 110 with the image 111. Moreover, the search result may beprovided according to the ranks determined in operation 844.Accordingly, the click-through rate determined in operation 832, thescore determined in operation 530, any of the multiple scores determinedin operation 710, the overall score determined in operation 720, or anysuitable combination thereof, may constitute one or more factors thatinfluence the presentation of search results that have images for itemsreferenced to therein. Hence, the visual characteristics of images maybe analyzed to raise or lower the rankings of the images (e.g., forpurposes of merchandising items depicted therein).

According to certain example embodiments, the image assessment machine310 may detect images that depict the same item, despite the imageshaving significantly distinct visual characteristics. Such significantlydistinct visual characteristics may be due to differences in lightingconditions under which the images were captured (e.g., taken by theirrespective cameras). Accordingly, certain example embodiments of themethod 500 include operations 821, 822, 823, and 846.

In operation 821, the analysis module 420 generates a histogram of theimage 111 (e.g., based on the image data included in the image 111). Theimage 111 may depict the item (e.g., pants) under a particular lightingcondition (e.g., a first lighting condition). The histogram generated inoperation 821 may represent or depict energy levels of the image 111(e.g., entity levels encoded by the image data included in the image111). An energy level may also be referred to as an “energy value.”

In operation 822, the analysis module 420 partitions the energy levelsof the image 111 in two multiple partitions of a histogram (e.g.,multiple sections or bands within the histogram). These multiplepartitions may be known as “bins.”

In some example embodiments, the energy levels within the multiplepartitions of the histogram may be blurred or spread between adjoiningpartitions (e.g., neighboring bins). For example, the energy levels maybe processed in a manner that determines or modifies the energy level ofone partition based on the energy level of one or more neighboringpartitions. In operation 823, the analysis module 420 performs suchprocessing. In some example embodiments, the analysis module 420determines an energy level (e.g., a first energy level) of a partitionamong the multiple partitions based on another energy level (e.g., asecond energy level) in an adjacent partition among the multiplepartitions. Accordingly, some example embodiments of the analysis module420 may normalize one or more energy levels in the histogram relative toneighboring energy levels.

Based on the processing of the histogram in operations 821 and 822, theanalysis module 420 may determine (e.g., detect) that the item shownunder a particular lighting condition in the image 111 is also shownunder a different lighting condition (e.g., a second lighting condition)in another image. In response to this determination, the action module440, in operation 846, may provide an indication that the item depictedin the image 111 is also depicted in the other image (e.g., a furtherimage). For example, the indication may state, “Here is another pictureof the same item. Do you wish to use it instead?” In various exampleembodiments, such an indication may be provided as all or part ofoperation 540 (e.g., as all or part of operation 640 or operation 641).

According to various example embodiments, one or more of themethodologies described herein may facilitate image quality assessmentto merchandise one or more items. Moreover, one or more of themethodologies described herein may facilitate identification of imagesthat depict the same item, despite showing the item under differentlighting conditions. Hence, one or more the methodologies describedherein may facilitate presentation of advertisements or similarinformation regarding items depicted by images.

When these effects are considered in aggregate, one or more of themethodologies described herein may obviate a need for certain efforts orresources that otherwise would be involved in assessing image quality,merchandising items, identifying redundant images, or any suitablecombination thereof. Efforts expended by a user in performing such tasksmay be reduced by one or more of the methodologies described herein.Computing resources used by one or more machines, databases, or devices(e.g., within the network environment 300) may similarly be reduced.Examples of such computing resources include processor cycles, networktraffic, memory usage, data storage capacity, power consumption, andcooling capacity.

FIG. 9 is a block diagram illustrating components of a machine 900,according to some example embodiments, able to read instructions from amachine-readable medium (e.g., a machine-readable storage medium) andperform any one or more of the methodologies discussed herein.Specifically, FIG. 9 shows a diagrammatic representation of the machine900 in the example form of a computer system and within whichinstructions 924 (e.g., software) for causing the machine 900 to performany one or more of the methodologies discussed herein may be executed.In alternative embodiments, the machine 900 operates as a standalonedevice or may be connected (e.g., networked) to other machines. In anetworked deployment, the machine 900 may operate in the capacity of aserver machine or a client machine in a server-client networkenvironment, or as a peer machine in a peer-to-peer (or distributed)network environment. The machine 900 may be a server computer, a clientcomputer, a personal computer (PC), a tablet computer, a laptopcomputer, a netbook, a set-top box (STB), a personal digital assistant(PDA), a cellular telephone, a smartphone, a web appliance, a networkrouter, a network switch, a network bridge, or any machine capable ofexecuting the instructions 924, sequentially or otherwise, that specifyactions to be taken by that machine. Further, while only a singlemachine is illustrated, the term “machine” shall also be taken toinclude a collection of machines that individually or jointly executethe instructions 924 to perform any one or more of the methodologiesdiscussed herein.

The machine 900 includes a processor 902 (e.g., a central processingunit (CPU), a graphics processing unit (GPU), a digital signal processor(DSP), an application specific integrated circuit (ASIC), aradio-frequency integrated circuit (RFIC), or any suitable combinationthereof), a main memory 904, and a static memory 906, which areconfigured to communicate with each other via a bus 908. The machine 900may further include a graphics display 910 (e.g., a plasma display panel(PDP), a light emitting diode (LED) display, a liquid crystal display(LCD), a projector, or a cathode ray tube (CRT)). The machine 900 mayalso include an alphanumeric input device 912 (e.g., a keyboard), acursor control device 914 (e.g., a mouse, a touchpad, a trackball, ajoystick, a motion sensor, or other pointing instrument), a storage unit916, a signal generation device 918 (e.g., a speaker), and a networkinterface device 920.

The storage unit 916 includes a machine-readable medium 922 on which isstored the instructions 924 (e.g., software) embodying any one or moreof the methodologies or functions described herein. The instructions 924may also reside, completely or at least partially, within the mainmemory 904, within the processor 902 (e.g., within the processor's cachememory), or both, during execution thereof by the machine 900.Accordingly, the main memory 904 and the processor 902 may be consideredas machine-readable media. The instructions 924 may be transmitted orreceived over a network 926 (e.g., network 390) via the networkinterface device 920.

As used herein, the term “memory” refers to a machine-readable mediumable to store data temporarily or permanently and may be taken toinclude, but not be limited to, random-access memory (RAM), read-onlymemory (ROM), buffer memory, flash memory, and cache memory. While themachine-readable medium 922 is shown in an example embodiment to be asingle medium, the term “machine-readable medium” should be taken toinclude a single medium or multiple media (e.g., a centralized ordistributed database, or associated caches and servers) able to storeinstructions. The term “machine-readable medium” shall also be taken toinclude any medium that is capable of storing instructions (e.g.,software) for execution by a machine (e.g., machine 900), such that theinstructions, when executed by one or more processors of the machine(e.g., processor 902), cause the machine to perform any one or more ofthe methodologies described herein. The term “machine-readable medium”shall accordingly be taken to include, but not be limited to, a datarepository in the form of a solid-state memory, an optical medium, amagnetic medium, or any suitable combination thereof.

Throughout this specification, plural instances may implementcomponents, operations, or structures described as a single instance.Although individual operations of one or more methods are illustratedand described as separate operations, one or more of the individualoperations may be performed concurrently, and nothing requires that theoperations be performed in the order illustrated. Structures andfunctionality presented as separate components in example configurationsmay be implemented as a combined structure or component. Similarly,structures and functionality presented as a single component may beimplemented as separate components. These and other variations,modifications, additions, and improvements fall within the scope of thesubject matter herein.

Certain embodiments are described herein as including logic or a numberof components, modules, or mechanisms. Modules may constitute eithersoftware modules (e.g., code embodied on a machine-readable medium or ina transmission signal) or hardware modules. A “hardware module” is atangible unit capable of performing certain operations and may beconfigured or arranged in a certain physical manner. In various exampleembodiments, one or more computer systems (e.g., a standalone computersystem, a client computer system, or a server computer system) or one ormore hardware modules of a computer system (e.g., a processor or a groupof processors) may be configured by software (e.g., an application orapplication portion) as a hardware module that operates to performcertain operations as described herein.

In some embodiments, a hardware module may be implemented mechanically,electronically, or any suitable combination thereof. For example, ahardware module may include dedicated circuitry or logic that ispermanently configured to perform certain operations. For example, ahardware module may be a special-purpose processor, such as a fieldprogrammable gate array (FPGA) or an ASIC. A hardware module may alsoinclude programmable logic or circuitry that is temporarily configuredby software to perform certain operations. For example, a hardwaremodule may include software encompassed within a general-purposeprocessor or other programmable processor. It will be appreciated thatthe decision to implement a hardware module mechanically, in dedicatedand permanently configured circuitry, or in temporarily configuredcircuitry (e.g., configured by software) may be driven by cost and timeconsiderations.

Accordingly, the phrase “hardware module” should be understood toencompass a tangible entity, be that an entity that is physicallyconstructed, permanently configured (e.g., hardwired), or temporarilyconfigured (e.g., programmed) to operate in a certain manner or toperform certain operations described herein. As used herein,“hardware-implemented module” refers to a hardware module. Consideringembodiments in which hardware modules are temporarily configured (e.g.,programmed), each of the hardware modules need not be configured orinstantiated at any one instance in time. For example, where a hardwaremodule comprises a general-purpose processor configured by software tobecome a special-purpose processor, the general-purpose processor may beconfigured as respectively different special-purpose processors (e.g.,comprising different hardware modules) at different times. Software mayaccordingly configure a processor, for example, to constitute aparticular hardware module at one instance of time and to constitute adifferent hardware module at a different instance of time.

Hardware modules can provide information to, and receive informationfrom, other hardware modules. Accordingly, the described hardwaremodules may be regarded as being communicatively coupled. Where multiplehardware modules exist contemporaneously, communications may be achievedthrough signal transmission (e.g., over appropriate circuits and buses)between or among two or more of the hardware modules. In embodiments inwhich multiple hardware modules are configured or instantiated atdifferent times, communications between such hardware modules may beachieved, for example, through the storage and retrieval of informationin memory structures to which the multiple hardware modules have access.For example, one hardware module may perform an operation and store theoutput of that operation in a memory device to which it iscommunicatively coupled. A further hardware module may then, at a latertime, access the memory device to retrieve and process the storedoutput. Hardware modules may also initiate communications with input oroutput devices, and can operate on a resource (e.g., a collection ofinformation).

The various operations of example methods described herein may beperformed, at least partially, by one or more processors that aretemporarily configured (e.g., by software) or permanently configured toperform the relevant operations. Whether temporarily or permanentlyconfigured, such processors may constitute processor-implemented modulesthat operate to perform one or more operations or functions describedherein. As used herein, “processor-implemented module” refers to ahardware module implemented using one or more processors.

Similarly, the methods described herein may be at least partiallyprocessor-implemented, a processor being an example of hardware. Forexample, at least some of the operations of a method may be performed byone or more processors or processor-implemented modules. Moreover, theone or more processors may also operate to support performance of therelevant operations in a “cloud computing” environment or as a “softwareas a service” (SaaS). For example, at least some of the operations maybe performed by a group of computers (as examples of machines includingprocessors), with these operations being accessible via a network (e.g.,the Internet) and via one or more appropriate interfaces (e.g., anapplication program interface (API)).

The performance of certain of the operations may be distributed amongthe one or more processors, not only residing within a single machine,but deployed across a number of machines. In some example embodiments,the one or more processors or processor-implemented modules may belocated in a single geographic location (e.g., within a homeenvironment, an office environment, or a server farm). In other exampleembodiments, the one or more processors or processor-implemented modulesmay be distributed across a number of geographic locations.

Some portions of this specification are presented in terms of algorithmsor symbolic representations of operations on data stored as bits orbinary digital signals within a machine memory (e.g., a computermemory). These algorithms or symbolic representations are examples oftechniques used by those of ordinary skill in the data processing artsto convey the substance of their work to others skilled in the art. Asused herein, an “algorithm” is a self-consistent sequence of operationsor similar processing leading to a desired result. In this context,algorithms and operations involve physical manipulation of physicalquantities. Typically, but not necessarily, such quantities may take theform of electrical, magnetic, or optical signals capable of beingstored, accessed, transferred, combined, compared, or otherwisemanipulated by a machine. It is convenient at times, principally forreasons of common usage, to refer to such signals using words such as“data,” “content,” “bits,” “values,” “elements,” “symbols,”“characters,” “terms,” “numbers,” “numerals,” or the like. These words,however, are merely convenient labels and are to be associated withappropriate physical quantities.

Unless specifically stated otherwise, discussions herein using wordssuch as “processing,” “computing,” “calculating,” “determining,”“presenting,” “displaying,” or the like may refer to actions orprocesses of a machine (e.g., a computer) that manipulates or transformsdata represented as physical (e.g., electronic, magnetic, or optical)quantities within one or more memories (e.g., volatile memory,non-volatile memory, or any suitable combination thereof), registers, orother machine components that receive, store, transmit, or displayinformation. Furthermore, unless specifically stated otherwise, theterms “a” or “an” are herein used, as is common in patent documents, toinclude one or more than one instance. Finally, as used herein, theconjunction “or” refers to a non-exclusive “or,” unless specificallystated otherwise.

The following descriptions define various example embodiments of methodsand systems (e.g., apparatus) discussed herein:

1. A method comprising:

-   -   accessing an image that depicts an item,        -   the image having a plurality of visual characteristics that            are quantifiable by an analysis of image data included in            the image;    -   performing the analysis of the image data included in the image        that depicts the item and that has the plurality of visual        characteristics;    -   determining a score that quantifies a visual characteristic        among the plurality of visual characteristics of the image that        depicts the item,        -   the determining of the score being based on the analysis of            the image data included in the image that depicts the item,        -   the determining of the score being performed by a processor            of a machine; and    -   performing an operation that makes reference to the image,        -   the operation being performed based on the score that            quantifies the visual characteristic among the plurality of            visual characteristics of the image that depicts the item.

2. The method of description 1 further comprising:

-   -   receiving the image as a submission from a seller of the item        for inclusion in an advertisement of the item; and wherein    -   the performing of the operation includes providing a warning to        the seller that the image is of low quality.

3. The method of description 1 or description 2 further comprising:

-   -   receiving the image as a submission from a seller of the item        for inclusion in an advertisement of the item; and wherein    -   the performing of the operation includes providing a suggestion        to the seller that the received image be replaced with an        alternative image of higher quality.

4. The method of any of descriptions 1-3, wherein:

-   -   the visual characteristic is a brightness of the image; and    -   the performing of the analysis of the image data includes        analyzing the brightness of the image;    -   the score quantifies the brightness of the image; and    -   the performing of the operation is based on the brightness of        the image.

5. The method of any of descriptions 1-4, wherein:

-   -   the visual characteristic is a contrast of the image;    -   the performing of the analysis of the image data includes        analyzing the contrast of the image;    -   the score quantifies the contrast of the image; and    -   the performing of the operation is based on the contrast of the        image.

6. The method of any of descriptions 1-5, wherein:

-   -   the visual characteristic is a saturation of the image;    -   the performing of the analysis of the image data includes        analyzing the saturation of the image;    -   the score quantifies the saturation of the image; and    -   the performing of the operation is based on the saturation of        the image.

7. The method of any of descriptions 1-6, wherein:

-   -   the visual characteristic is a colorfulness of the image;    -   the performing of the analysis of the image data includes        analyzing the colorfulness of the image;    -   the score quantifies the colorfulness of the image; and    -   the performing of the operation is based on the colorfulness of        the image.

8. The method of any of descriptions 1-7, wherein:

-   -   the visual characteristic is a dynamic range of the image;    -   the performing of the analysis of the image data includes        analyzing the dynamic range of the image;    -   the score quantifies the dynamic range of the image; and    -   the performing of the operation is based on the dynamic range of        the image.

9. The method of any of descriptions 1-8, wherein:

-   -   the visual characteristic is a size of the image;    -   the performing of the analysis of the image data includes        analyzing the size of the image;    -   the score quantifies the size of the image; and    -   the performing of the operation is based on the size of the        image.

10. The method of any of descriptions 1-9, wherein:

-   -   the visual characteristic is an aspect ratio of the image;    -   the performing of the analysis of the image data includes        analyzing the aspect ratio of the image;    -   the score quantifies the aspect ratio of the image; and    -   the performing of the operation is based on the aspect ratio of        the image.

11. The method of any of descriptions 1-10 further comprising:

-   -   segmenting the image into a foreground portion of the image that        depicts the item and a background portion of the image that        depicts no part of the item; wherein    -   the visual characteristic is a uniformity of the foreground        portion of the image;    -   the performing of the analysis of the image data includes        analyzing the uniformity of the foreground portion of the image;    -   the score quantifies the uniformity of the foreground portion of        the image; and    -   the performing of the operation is based on the uniformity of        the foreground portion of the image.

12. The method of any of descriptions 1-11 further comprising:

-   -   segmenting the image into a foreground portion of the image that        depicts the item and a background portion of the image that        depicts no part of the item; wherein    -   the visual characteristic is a uniformity of the background        portion of the image;    -   the performing of the analysis of the image data includes        analyzing the uniformity of the background portion of the image;    -   the score quantifies the uniformity of the background portion of        the image; and    -   the performing of the operation is based on the uniformity of        the background portion of the image.

13. The method of any of descriptions 1-12 further comprising:

-   -   segmenting the image into a foreground portion of the image that        depicts the item and a background portion of the image that        depicts no part of the item; wherein    -   the visual characteristic is a ratio of a number of foreground        pixels from the foreground portion to a number of background        pixels from the background portion;    -   the performing of the analysis of the image data includes        analyzing the ratio of the number of foreground pixels to the        number of background pixels;    -   the score quantifies the ratio of the number of foreground        pixels to the number of background pixels; and    -   the performing of the operation is based on the ratio of the        number of foreground pixels to the number of background pixels.

14. The method of any of descriptions 1-13 further comprising:

-   -   determining a plurality of scores as feature vectors that        quantify the plurality of visual characteristics of the image;        and    -   determining an overall score representative of an overall visual        quality of the image based on the plurality of scores; wherein    -   the performing of the operation is based on the overall score        that represents the overall visual quality of the image that        depicts the item.

15. The method of description 14, wherein:

-   -   the overall score represents an average brightness of the image;        and    -   the performing of the operation is based on the average        brightness of the image.

16. The method of description 14 or description 15, wherein:

-   -   the overall score represents an average contrast to the image;        and    -   the performing of the operation is based on the average contrast        to the image.

17. The method of any of descriptions 14-16, wherein:

-   -   the overall score represents an average lightness of the image,    -   the average lightness indicating an average distance from white        in a color space; and    -   the performing of the operation is based on the average        lightness of the image.

18. The method of any of descriptions 14-17 further comprising:

-   -   segmenting the image into a foreground portion of the image that        depicts the item and a background portion of the image that        depicts no part of the item; wherein    -   the overall score represents an average lightness of the        background portion of the image,    -   the average lightness indicating an average distance from white        in a color space; and    -   the performing of the operation is based on the average        lightness of the background portion of the image.

19. The method of any of descriptions 14-18 further comprising:

-   -   segmenting the image into a foreground portion of the image that        depicts the item and a background portion of the image that        depicts no part of the item; wherein    -   the overall score represents an average uniformity of the        background portion of the image; and    -   the performing of the operation is based on the average        uniformity of the background portion of the image.

20. The method of any of descriptions 1-19 further comprising:

-   -   determining a shape of the item by segmenting the image;    -   accessing a database that correlates the shape of the item with        textual information that describes the item; and wherein    -   the performing of the operation includes providing the textual        information based on the shape of the item being correlated with        the textual information.

21. The method of description 20 further comprising:

-   -   receiving different textual information as a submission for a        description of the item from a seller of the item; and wherein    -   the performing of the operation includes providing a warning to        the seller that the different textual information does not        describe the item.

22. The method of description 20 or description 21 further comprising:

-   -   receiving different textual information as a submission for a        description of the item from a seller of the item; and wherein    -   the performing of the operation includes providing a suggestion        to the seller that the item be described by the textual        information accessed from the database.

23. The method of any of descriptions 1-22, wherein:

-   -   the performing of the operation includes determining a rank of        the image of the item and providing the image of the item within        a search result,    -   the search result being provided according to the determined        rank among a plurality of search results.

24. The method of description 23 further comprising:

-   -   determining a click-through rate of an advertisement that        includes the image; and wherein    -   the determining of the rank of the image is based on the        click-through rate of the advertisement.

25. The method of any of descriptions 1-24, wherein:

-   -   the image depicts the item under a first lighting condition;    -   the performing of the analysis of the image data includes:        -   generating a histogram of the image,        -   partitioning energy values from the histogram of the image            into multiple partitions of the histogram, and        -   determining a first energy value of a partition among the            multiple partitions based on a second energy value of an            adjacent partition among the multiple partitions; and    -   the performing of the operation includes providing an indication        that the item depicted in the image is also depicted in a        further image that depicts the item under a second lighting        condition.

26. A non-transitory machine-readable storage medium comprisinginstructions that, when executed by one or more processors of a machine,cause the machine to perform operations comprising:

-   -   accessing an image that depicts an item,        -   the image having a plurality of visual characteristics that            are quantifiable by an analysis of image data included in            the image;    -   performing the analysis of the image data included in the image        that depicts the item and that has the plurality of visual        characteristics;    -   determining a score that quantifies a visual characteristic        among the plurality of visual characteristics of the image that        depicts the item,        -   the determining of the score being based on the analysis of            the image data included in the image that depicts the item,        -   the determining of the score being performed by the one or            more processors of the machine; and    -   performing an operation that makes reference to the image,    -   the operation being performed based on the score that quantifies        the visual characteristic among the plurality of visual        characteristics of the image that depicts the item.

27. The non-transitory machine-readable storage medium of description26, wherein the operations further comprise:

-   -   receiving the image as a submission from a seller of the item        for inclusion in an advertisement of the item; and wherein    -   the performing of the operation includes providing a warning to        the seller that the image is of low quality.

28. A system comprising:

-   -   an access module configured to access an image that depicts an        item,        -   the image having a plurality of visual characteristics that            are quantifiable by an analysis of image data included in            the image;    -   an analysis module configured to perform the analysis of the        image data included in the image that depicts the item and that        has the plurality of visual characteristics;    -   a processor configured by a score module that configures the        processor to determine a score that quantifies a visual        characteristic among the plurality of visual characteristics of        the image that depicts the item,    -   the determining of the score being based on the analysis of the        image data included in the image that depicts the item,        -   the determining of the score being performed by a processor            of a machine; and    -   an action module configured to perform an operation that makes        reference to the image,        -   the operation being performed based on the score that            quantifies the visual characteristic among the plurality of            visual characteristics of the image that depicts the item.

29. The system of description 28, wherein:

-   -   the access module is configured to receive the image as a        submission from a seller of the item for inclusion in an        advertisement of the item; and wherein    -   the operation module is configured to provide a warning to the        seller that the image is of low quality.

What is claimed is:
 1. A system comprising: a processor; and a memorystoring instructions that, when executed by the processor, causes theprocessor to perform operations comprising: receiving item listing datathat includes at least image data and textual data, wherein the imagedata comprises an image that includes a plurality of visualcharacteristics, and the textual data comprises a textual input to beapplied to the image; identifying an item depicted by the image based onthe plurality of visual characteristics; performing a comparison of theitem depicted by the image and the textual input to be applied to theimage; determining, based on the comparison of the item depicted by theimage and the textual input, that the item depicted by the image doesnot correlate with the textual input; and causing display of anotification at the client device in response to the determining thatthe item depicted by the image does not correlate with the textualinput, the notification comprising a presentation of a request for analternate image, the presentation of the request comprising a display ofa text string that includes the textual input.
 2. The system of claim 1;wherein the identifying the item depicted by the image based on theplurality of visual characteristics includes: detecting one or moreedges of the item based on the plurality of visual characteristics; anddetecting the item based on the one or more edges of the item.
 3. Thesystem of claim 1, wherein: the visual characteristic is a brightness ofthe foreground portion of the image, and wherein the set of scoresincludes a score of the brightness of the foreground portion of theimage.
 4. The system of claim 1, wherein: the visual characteristic is acontrast level of the foreground portion of the image, and wherein theset of scores includes a score of the contrast level of the foregroundportion of the image.
 5. The system of claim 1, wherein: the visualcharacteristic is a saturation of the first foreground portion of theimage, and wherein the set of scores includes a score of the saturationof the foreground portion of the image.
 6. The system of claim 1,wherein: the visual characteristic is an aspect ratio of the foregroundportion of the image, and wherein the set of scores includes a score ofthe aspect ratio of the foreground portion of the image.
 7. The systemof claim 1, wherein: the visual characteristic is a uniformity of theforeground portion of the image; and wherein the set of scores includesa score of the uniformity of the foreground portion of the image.
 8. Thesystem of claim 1, wherein: the visual characteristic is a uniformity ofthe background portion of the image, and wherein the set of scoresincludes a score of the uniformity of the background portion of theimage; and wherein the calculating the overall score of the image isbased on the set of scores of the foreground portion of the image andthe background portion of the image.
 9. The system of claim 1, whereinthe visual characteristic is a ratio of a number of foreground pixelsfrom the foreground portion of the image to a number of backgroundpixels from the background portion of the image; and wherein thecalculating the set of scores includes calculating a score based on theratio.
 10. The system of claim 1, wherein: the visual characteristic isan average lightness of the image, and wherein the set of scoresincludes a score of the average lightness of the image.
 11. The systemof claim 1, wherein: the image is a first image that depicts an itemunder a first lighting condition; determining that the item depictedunder the first lighting condition is also depicted in a second imagethat depicts the item under a second lighting condition; and based onthe determining, providing an indication that the item depicted in thefirst image is also depicted in the second image under the secondlighting condition.
 12. A method comprising: receiving item listing datathat includes at least image data and textual data, wherein the imagedata comprises an image that includes a plurality of visualcharacteristics, and the textual data comprises a textual input to beapplied to the image; identifying an item depicted by the image based onthe one or more edges of the foreground portion of the image; performinga comparison of the item depicted by the image and the textual input tobe applied to the image; determining, based on the comparison of theitem depicted by the image and the textual input, that the item depictedby the image does not correlate with the textual input; and causingdisplay of a notification at the client device in response to thedetermining that the item depicted by the image does not correlate withthe textual input, the notification comprising a presentation of arequest for an alternate image, the presentation of the requestcomprising a display of a text string that includes the textual input.13. The method of claim 12, wherein the identifying the item depicted bythe image based on the plurality of visual characteristics includes:detecting one or more edges of the item based on the plurality of visualcharacteristics; and detecting the item based on the one or more edgesof the item.
 14. The method of claim 12, wherein: the visualcharacteristic is a brightness of the foreground portion of the image,and the set of scores includes a score of the brightness of theforeground portion of the image.
 15. The method of claim 12, wherein:the visual characteristic is a contrast level of the foreground portionof the image, and wherein the set of scores includes a score of thecontrast level of the foreground portion of the image.
 16. The method ofclaim 12, wherein: the visual characteristic is a saturation of thefirst foreground portion of the image, and wherein the set of scoresincludes a score of the saturation of the foreground portion of theimage.
 17. The method of claim 12, wherein: the visual characteristic isan aspect ratio of the foreground portion of the image, and wherein theset of scores includes a score of the aspect ratio of the foregroundportion of the image.
 18. The method of claim 12, wherein: the visualcharacteristic is a uniformity of the foreground portion of the image,and wherein the set of scores includes a score of the uniformity of theforeground portion of the image.
 19. A non-transitory machine-readablestorage medium comprising instructions that, when executed by one ormore processors of a machine, cause the machine to perform operationscomprising: receiving item listing data that includes textual data, andimage data from a client device, wherein the image data comprises aplurality of visual characteristics, and the textual data comprises atextual input to be applied to the image; identifying an item depictedby the image based on the plurality of visual characteristics;performing a comparison of the item depicted by the image and thetextual input to be applied to the image; determining, based on thecomparison of the item depicted by the image and the textual input, thatthe item depicted by the image does not correlate with the textualinput; and causing display of a notification at the client device inresponse to the determining that the item depicted by the image does notcorrelate with the textual input, the notification comprising apresentation of a request for an alternate image, the presentation ofthe request comprising a display of a text string that includes thetextual input.
 20. The non-transitory machine-readable storage medium ofclaim 19, wherein the identifying the item depicted by the image basedon the plurality of visual characteristics includes: detecting one ormore edges of the item based on the plurality of visual characteristics;and detecting the item based on the one or more edges of the item.